A Data Quality Metamodel Extension to CWM
نویسندگان
چکیده
The importance of metadata has been broadly referred in the last years, mainly in the field of data warehousing and decision support systems. Contemporarily, in the adjacent field of data quality, several approaches and tools have been set out for the purpose of data profiling and cleaning. However, little effort has been made in order to formally specify metrics and techniques for data quality in a structured way. As a matter of fact, little relevance has been assigned to metadata regarding data quality and data cleaning issues. This paper aims at filling this gap, proposing a conceptual metamodel for data quality and cleaning, both applicable to operational and data warehousing contexts. The presented metadata model is integrated with OMG’s CWM, offering a possible extension of this standard toward data quality.
منابع مشابه
A Semantic Approach towards CWM-based ETL Processes
Nowadays, on the basis of a common standard for metadata representation and interchange mechanism in data warehouse environments, Common Warehouse Metamodel (CWM) – based ETL processes still has to face significant challenges in semantically and systematically integrating heterogeneous sources to data warehouse. In this context, we focus on proposing an ontology-based ETL framework for covering...
متن کاملA Standard for Representing Multidimensional Properties: The Common Warehouse Metamodel (CWM)
Data warehouses, multidimensional databases, and OLAP tools are based on the multidimensional (MD) modeling. Lately, several approaches have been proposed to easily capture main MD properties at the conceptual level. These conceptual MD models, together with a precise management of metadata, are the core of any related tool implementation. However, the broad diversity of MD models and managemen...
متن کاملIntegration and Reuse of Heterogeneous Information: Hetero-Homogeneous Data Warehouse Modeling in the Common Warehouse Metamodel
The corporate data warehouse integrates data from various operational data stores of a company. These operational data stores may be heterogeneous with respect to the represented information. The hetero-homogeneous data warehouse modeling approach overcomes issues associated with the integration of heterogeneous information from the operational data stores by featuring a generally homogeneous s...
متن کاملDas Common Warehouse Metamodel als Referenzmodell für Metadaten im Data Warehouse und dessen Erweiterung im SAP Business Information Warehouse
Heterogene Data Warehouse-Landschaften sind durch eine Vielzahl verschiedener Softwarekomponenten gekennzeichnet, deren Integration zu einer funktionierenden Business Intelligence-Lösung eine besondere Herausforderung darstellt. Die Metadaten der beteiligten Komponenten stellen dabei einen viel versprechenden Ansatz der effektiven und effizienten Verknüpfung dar, die aber durch die proprietären...
متن کاملUm Metamodelo para a Especificação de Data Warehouses Geográficos
The decision-making processses can be supported by many tools such as DW (Data Warehouse), OLAP (On-Line Analytical Processing) and GIS (Geographical Information System). Much research found in literature is aimed at integrating these technologies. However, the metamodeling of spatial and dimensional schemas for GDW (Geographical DW) is still an open question. In this context, this paper propos...
متن کامل